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ABSTRACT 


The  purpose  of  this  thesis  is  to  analyze  historical  United  States  Marine  Corps  enlisted 
attrition  behavior  and  apply  time  series  forecasting  techniques  by  grade  and  Years  of 
Service  in  order  to  identify  methods  to  improve  manpower  analysts’  ability  to  effectively 
forecast  attrition  behavior.  This  study  compared  the  results  of  one  to  five-year  Moving 
Average  models  and  the  results  of  one  to  five-year  Weighted  Moving  Average  models 
based  on  two  Measures  of  Effectiveness,  Mean  Square  Error  and  the  Mean  Absolute 
Percent  Error.  The  results  of  the  Friedman  test  indicate  statistical  significance  of  the 
results  in  relation  to  the  Mean  Square  Error  of  the  one  to  two-year  Moving  Average 
models.  This  thesis  demonstrates  that  in  most  cases,  a  simple  one -year  Moving  Average 
more  effectively  estimates  attrition  behavior  than  the  other  Moving  Average  or  Weighted 
Moving  Average  models  analyzed. 
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EXECUTIVE  SUMMARY 


Manpower  and  personnel  management  costs  are  significant  in  the  Marine  Corps.  The 
mismanagement  of  any  area  in  the  field  of  manpower  analysis  can  impact  the  operational 
readiness  of  the  entire  Marine  Corps.  The  Marine  Corps  must  now  operate  in  a  more 
fiscally  constrained  environment  and  continue  to  provide  the  same  level  of  effectiveness 
on  the  battlefield.  Over  90%  of  the  Marine  Corps’  total  force  are  enlisted  Marines  and  as 
a  result,  the  accurate  management  of  these  personnel  is  most  critical.  As  a  portion  of  the 
manpower  planning  process,  the  forecasted  attrition  of  enlisted  personnel  is  required  to 
effectively  execute  related  tasks  by  manpower  analysts. 

The  purpose  of  this  thesis  is  to  analyze  historical  United  States  Marine  Corps 
enlisted  attrition  behavior  and  apply  time  series  forecasting  techniques  by  grade  and 
Years  of  Service  in  order  to  identify  methods  to  improve  manpower  analysts’  ability  to 
effectively  forecast  attrition  behavior.  The  scope  of  this  study  is  limited  to  active  duty 
enlisted  Marines  in  the  grades  of  E-l  through  E-9,  and  Years  of  Service  between  one  and 
thirty  years.  Observed  attrition  behavior  is  used  as  the  basis  of  accuracy  for  the  Moving 
Average  and  Weighted  Moving  Average  models  according  to  two  Measures  of 
Effectiveness,  Mean  Square  Error  and  Mean  Absolute  Percent  Error.  The  difference  in 
model  performance  is  measured  for  statistical  significance  utilizing  the  Friedman  Test. 

This  thesis  demonstrates  that  in  most  cases,  a  simple  one -year  Moving  Average 
model  more  effectively  estimates  attrition  behavior  than  other  Moving  Average  or 
Weighted  Moving  Average  models.  Based  on  this  analysis,  the  recommendation  to  MPP- 
20  and  MPP-50  is  that  the  use  of  a  one-year  Moving  Average  model  is  the  most  effective 
way  to  estimate  enlisted  attrition  rates  in  the  Marine  Corps  by  grade  and  Years  of 
Service,  regardless  of  the  Measure  of  Effectiveness  of  either  the  Mean  Square  Error  or 
the  Mean  Absolute  Percent  Error. 
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I.  INTRODUCTION 


A.  BACKGROUND 

Manpower  and  personnel  management  costs  consume  significant  portions  of  the 
Military  Personnel,  Marine  Corps  (MPMC)  budget.  More  importantly,  however,  the 
consequences  of  inaccuracy  in  these  areas  can  have  dramatic  results  on  the  operational 
readiness  of  the  Marine  Corps.  The  Fiscal  Year  (FY)  11  end  strength  in  the  Marine 
Corps  is  202,100  Marines,  and  approximately  90  percent  of  that  total  force  are  enlisted 
Marines.  The  FY1 1  MPMC  budget  is  $13.3  billion,  which  is  approximately  50  percent  of 
the  Marines  Corps’  total  baseline  Total  Obligation  Authority  (TOA)  (Concepts  & 
Programs  2010).  Of  the  13  separate  budget  accounts  in  the  Marines  Corps,  the  MPMC 
account  is  the  primary  account  to  receive  full  obligation  whether  or  not  it  was  budgeted 
correctly,  which  can  result  in  partial  obligation  in  any  of  the  other  12  accounts.  This  was 
the  case  when  the  over  budget  of  FY’s  01-02  end  strength  resulted  in  the  re-allocation  of 
$200  million  from  the  Operations  and  Maintenance  (O&M)  account  (Hattiangadi, 
Kimble,  Lambert,  Quester  2005).  This  type  of  miscalculation  is  costly,  not  only  in 
budgeted  dollars,  but  the  impact  on  operational  readiness  of  the  entire  Marine  Corps  can 
suffer  more  significant  consequences  while  engaged  in  combat  operations  overseas. 

The  Marine  Corps  must  be  able  to  operate  in  a  more  fiscally  and  personnel 
constrained  environment  than  what  the  Corps  is  accustomed  to,  based  on  the  past  decade 
of  combat  operations  overseas.  In  his  testimony  to  the  House  Armed  Services  Committee 
on  1  March  2011,  General  Amos  stated,  “The  Marine  Corps  is  re -posturing  and 
rebalancing  for  the  future.”  He  also  introduces  the  tenn  “middle-weight  force”  to 
describe  the  capability  that  is  currently  missing  between  the  special  operations  forces  and 
conventional  units.  The  Marine  Corps  will  fill  that  gap  and,  as  a  result,  “The  drawdown 
of  our  active  component  from  202,100  to  186,000  must  be  conditions  based,  and  only 
after  completion  of  our  mission  in  Afghanistan”  (Amos  2011). 

The  reduction  of  these  approximately  16,000  active  duty  Marines  is  not  a  simple 

task  and  must  be  effectively  managed  by  manpower  analysts.  In  preparation  of  these 
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reductions,  the  enlisted  manpower  analysts  working  in  the  Manpower  Plans,  Programs,  & 
Budget  branch  (MPP),  within  the  Manpower  and  Reserve  Affairs  (M&RA)  department, 
play  a  vital  role  in  any  manpower  and  personnel  analysis.  Due  to  the  large  proportion  of 
enlisted  Marines,  Enlisted  Plans  (MPP-20)  and  Integration  and  Analysis  (MPP-50) 
sections  are  the  lead  agencies  dealing  directly  with  the  effects  of  reductions  on  the  total 
force.  Reducing  end  strength  by  any  significant  amount  of  Marines  will  have  important 
consequences  on  the  budget,  but  finding  the  appropriate  time  frame  to  reduce  the  force 
while  minimizing  the  adverse  impacts  on  retention,  promotion  and  retirement  within  the 
Marine  Corps  is  vital. 

The  application  of  time  series  forecasting  techniques  to  analyze  enlisted  attrition 
behavior  by  grade  and  Years  of  Service  (YOS)  is  an  important  part  of  the  manpower 
management  process.  This  thesis  demonstrates  that  in  most  cases,  a  simple  one -year 
Moving  Average  (MA)  more  effectively  estimates  attrition  behavior  than  other  MA  or 
Weighted  Moving  Average  (WMA)  models. 

B.  PURPOSE 

The  purpose  of  this  thesis  is  to  analyze  historical  United  States  Marine  Corps 
(USMC)  enlisted  attrition  behavior  and  apply  time  series  forecasting  techniques  by  grade 
and  YOS  in  order  to  identify  methods  to  improve  manpower  analysts’  ability  to 
effectively  forecast  attrition  behavior.  The  primary  research  questions  that  will  focus  this 
analysis  are: 

1.  Of  the  techniques  most  accessible  to  manpower  analysts,  which  best  forecast 
enlisted  attrition  behavior  in  the  Marine  Corps  by  grade  and  YOS? 

2.  How  does  the  choice  of  technique  depend  on  the  measure  of  effectiveness? 

C.  SCOPE  AND  METHODOLOGY 

This  study  analyzes  enlisted  attrition  behavior  utilizing  time  series  forecasting 
techniques  based  on  grade  and  YOS  combinations.  The  scope  of  this  study  is  limited  to 
active  duty  enlisted  Marines  categorized  in  the  grades  of  E-l  through  E-9,  and  YOS 

between  one  and  thirty  years.  This  study  applies  the  service  limits  for  grades  E-4  through 
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E-9  found  in  the  Enlisted  Career  Force  Controls  (ECFC)  in  order  to  standardize  the 
estimates.  Observed  attrition  rates  serve  as  the  baseline  against  which  the  model 
performance  is  measured.  For  the  purpose  of  this  study,  the  tenn  attrition  is  defined  as 
any  enlisted  Marine  that  leaves  active  duty,  regardless  of  the  reasoning.  Below  are  the 
categories  of  attrition  used  in  this  study. 

1 .  End  of  Active  Service  (EAS) 

a.  First  Term  -  Enlisted  Marines  who  finish  their  initial  obligated 
enlistment  and  do  not  re-enlist. 

b.  Intermediate  -  Enlisted  Marines  who  have  re-enlisted  at  least  once, 
but  get  out  before  a  third  re-enlistment  (4-13  years). 

c.  Careerists  -  Enlisted  Marines  who  have  re-enlisted  at  least  three 
times  (14-19  years). 

2.  Non-End  of  Active  Service  (NEAS) 

a.  Recruit  losses  -  Recruits  who  do  not  graduate  from  recruit 
training. 

b.  Medical  discharge  -  Enlisted  Marines  who  are  medically  separated 
from  the  Marine  Corps  prior  to  their  EAS. 

c.  Administrative  separation  -  Enlisted  Marines  who  are 

administratively  separated  from  the  Marine  Corps  prior  to  their 
EAS. 

d.  Punitive  discharge  -  Enlisted  Marines  who  are  punitively 
discharged  from  the  Marine  Corps  prior  to  their  EAS. 

e.  Deserter  losses  -  Enlisted  Marines  who  are  on  Unauthorized 
Absence  status  for  30  consecutive  days. 

f.  Death 
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3.  Enlisted  to  Officer  Transitions  -  Enlisted  Marines  who  accept  a  commission  in 
the  Marine  Corps. 

4.  Other  Losses  -  Any  other  loss  not  categorized  above. 

D.  ORGANIZATION  OF  THE  STUDY 

Chapter  I  introduces  the  thesis  research  topic  and  covers  the  background,  purpose 
of  the  research  and  the  scope  and  methodology  behind  this  study.  Chapter  II  provides  a 
literature  review  of  previous  research  that  relates  to  this  thesis  topic  that  influence 
decisions  and  assumptions  made  during  this  study.  Chapter  III  introduces  the  data  and 
analysis  software  used  to  calculate  enlisted  attrition  rates  by  grade  and  YOS.  This 
chapter  also  describes  in  detail  the  methodology  behind  each  step  during  the  data  analysis 
portion  of  this  research.  Chapter  IV  discusses  the  results  found  after  the  completion  of 
the  data  analysis  and  applies  the  Friedman  test  to  determine  significance  of  those  results. 
Chapter  V  summarizes  the  findings  from  Chapter  IV  and  makes  recommendations  to 
MPP-20  and  MPP-50  in  regards  to  these  findings. 
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II.  LITERATURE  REVIEW 


A.  INTRODUCTION  OF  PREVIOUS  STUDIES 

Before  further  discussion  of  this  study,  an  overview  of  previous  attrition  literature 
that  influenced  this  study  is  necessary.  There  are  a  number  of  attrition  and  loss  studies 
about  the  Marine  Corps.  The  tenn  loss  is  synonymous  with  the  tenn  attrition,  with  the 
latter  being  used  primarily  in  the  remainder  of  this  study.  Throughout  the  research  of 
these  studies,  many  attempt  to  predict  the  future  attrition  behavior  of  Marines  by  using 
known  variables  of  the  individuals’  demographic  profile  combined  with  their  previous 
enlistment  behavior.  The  ability  to  predict  future  behavior  based  on  these  known 
variables  is  possible  in  many  cases  when  utilizing  multivariate  regression  modeling 
techniques,  but  the  accuracy  of  these  predictions  are  influenced  by  unobservable 
variables  that  cannot  be  accounted  for  in  these  prediction  models.  In  a  military  context, 
the  choice  of  an  individual  to  behave  a  certain  way  that  directly  affects  their  probability 
of  attrition  can  be  difficult  to  account  for  in  these  prediction  models.  As  seen  in  Chapter 
I,  there  are  a  number  of  reasons  for  attrition  in  the  Marine  Corps  and  the  ability  to 
effectively  forecast  these  losses  on  the  total  force  are  important  in  achieving  operational 
readiness  on  the  battlefield.  The  following  studies  are  similar  in  their  attempts  to  predict 
attrition  behavior  in  the  Marine  Corps,  but  differentiate  themselves  in  the  approaches 
taken  to  achieve  that  objective.  The  aspects  of  each  study  that  directly  influence  the 
decisions  made  in  this  thesis  are  thoroughly  discussed  in  the  following  sections. 

B.  HATTIANGADI,  KIMBLE,  LAMBERT  AND  QUESTER  (2005) 

Prior  to  discussing  previous  studies  on  forecasting  enlisted  attrition  in  the  Marine 
Corps,  it  is  necessary  to  first  understand  the  current  manpower  planning  process  used  in 
the  Marine  Corps.  A  Center  for  Naval  Analysis  (CNA)  report  completed  in  2005 
provides  a  thorough  analysis  of  the  enlisted  manpower  planning  process  currently  used  in 
the  Marine  Corps.  This  report  analyzes  the  existing  loss  forecasting  methods  used  by 
manpower  planners  in  Quantico,  Virginia,  assesses  the  effectiveness  of  those  methods  in 
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order  to  make  improvements  to  those  models,  and  documents  this  improved  manpower 
management  process  for  future  reference  by  manpower  planners.  The  CNA  analysis  also 
looks  at  the  officer  manpower  plan  model,  but  this  thesis  focuses  on  the  enlisted 
manpower  plan  model  portion  of  the  CNA  study.  The  purpose  of  including  this  report  is 
to  provide  a  basic  understanding  of  the  enlisted  manpower  planning  process  in  the 
Marine  Corps  and  to  identify  the  methods  currently  employed  by  the  manpower  analysts 
to  forecast  enlisted  attrition  behavior. 

The  CNA  study  is  a  response  for  the  need  for  an  accurate  manpower  forecasting 
process  in  the  Marine  Corps.  This  is  because  of  the  large  proportion  of  the  Marine  Corps 
budget  that  is  spent  on  personnel  “Manpower  costs  are  about  $9.4  billion  annually,  or 
almost  60  percent  of  the  Marine  Corps’  annual  budget”  (5)  and  the  costly  results  of 
inaccurate  forecasts. 

Estimates  had  been  incorrect  in  the  past  due  to  the  ad  hoc  nature  of  the 
loss  forecasting  processes.  Previously  there  was  no  institutionalized  and 
documented  methodology  for  forecasting  losses  and  no  systematic  attempt 
to  improve  existing  loss-forecasting  techniques.  New  planners  relied  on 
information  they  gleaned  during  overlap  period  with  their  predecessors 
and  sometimes  developed  their  own  methods  (1). 

This  documented  history  of  inconsistent  forecasting  of  attrition  behavior  in  the  Marine 
Corps  provides  the  reasoning  behind  a  review  of  the  entire  manpower  planning  process 
being  used  at  the  time  of  this  study  in  2005.  The  following  paragraphs  summarize  the 
enlisted  manpower  plan  model  explained  in  the  study. 

The  CNA  report  discusses  some  fundamental  definitions  and  congressionally 
mandated  requirements  placed  on  the  Marine  Corps.  The  authors  begin  the  analysis  by 
explaining  Title  X  end  strength  rules  and  the  applicability  to  the  Marine  Corps  manpower 
planning  process.  As  defined  in  the  study,  end  strength  is  the  number  of  service  members 
in  a  particular  service  on  the  last  day  of  the  FY,  30  September.  Title  X  allows  each 
service  to  exceed  end  strength  by  two  to  three  percent.  Current  Marine  Corps  policy  sets 
the  maximum  percentage  of  those  who  can  be  in  the  top  six  enlisted  grades  at  54  percent. 
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This  congressionally  mandated  end  strength  target  applies  to  the  sum  of  active-duty 
Marine  Corps  officers  and  enlisted  personnel.  The  fundamental  end  strength  equation  is: 

Beginning  strength  -  Losses  +  Gains  =  End  strength 

The  end  strength  at  the  end  of  the  previous  FY  is  the  beginning  end  strength  of  the  next 
FY. 

Before  discussing  the  manpower  planning  process  in  the  Marine  Corps,  an 
understanding  of  the  basic  components  of  the  enlisted  end  strength  model  is  required. 
The  authors  describe  the  six  components  and  the  sub-components  of  the  enlisted 
manpower  plan  model.  Chapter  I  of  this  study  contains  the  definitions  of  four  of  the  six 
components;  this  section  will  define  the  adjustments  and  gains  models  when  appropriate. 
Figure  1  contains  the  six  components  of  the  complete  manpower  plan  model.  Each 
model  is  forecasted  separately  by  month  and  grade. 


Figure  1.  Marine  Corps  Enlisted  End  Strength  Models 

The  EAS  Loss  Model  is  the  most  significant  portion  of  the  manpower  plan  model 
and  requires  special  attention.  This  is  because  EAS  losses  account  for  over  half  of  the 
active  duty  enlisted  losses.  These  losses  are  broken  down  by  first  term,  intermediate  and 

careerists.  First  tenn  EAS  losses  are  managed  by  MPP-20  utilizing  the  First  Term 
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Alignment  Plan  (FTAP).  The  FTAP  is  a  steady  state  model  that  determines  the  number 
of  reenlistments  by  Primary  Military  Occupational  Specialty  (PMOS).  Each  requirement 
is  a  “boatspace,”  and  recommended  first  term  Marines  cannot  reenlist  without  an 
available  boatspace  in  that  PMOS.  After  calculation  of  the  execution  FY  FTAP,  or  the 
number  of  first  tenn  Marines  who  will  stay  in  the  Corps  in  the  execution  FY,  the  enlisted 
strength  planners  apply  a  three-year  average  of  previous  monthly  FTAP  distributions  to 
determine  the  percentage  of  Marines  who  will  stay  across  the  months  in  the  execution 
FY.  This  percentage  is  multiplied  by  the  FTAP  for  the  execution  FY  by  month  and  the 
resulting  number  is  the  forecasted  number  of  first  term  Marines  staying  in  the  Corps. 
Intermediate  and  careerist  EAS  losses  are  calculated  differently  than  first  tenn  EAS 
losses  because  all  eligible  Marines  within  these  categories  are  allowed  to  reenlist. 
Intermediate  and  careerist  losses  are  calculated  by  using  the  straight-line  average  of  the 
previous  three  years  of  continuation  rates  at  YOS  4-19.  These  rates  are  applied  to  the 
EAS  population  in  the  execution  FY  by  month  in  order  to  calculate  the  number  of 
Marines  remaining  in  the  Corps  in  these  two  zones.  After  calculation  of  all  three  sub¬ 
components  of  the  EAS  loss  model,  the  monthly  EAS  losses  are  phased  by  grade. 
Utilizing  a  weighted  average  of  the  historical  grade  distribution  of  EAS  losses,  enlisted 
strength  planners  set  the  weighted  average  with  up  to  four  previous  years’  data  and  set 
unequal  weights  within  non-consecutive  years  if  necessary.  This  weight  differs  by  grade 
and  is  applied  to  total  EAS  losses  by  month  in  order  to  calculate  the  total  amount  of 
Marines  remaining  on  active  duty  in  the  EAS  loss  model. 

The  next  most  important  portion  of  the  manpower  plan  model  is  the  NEAS  Loss 
Model.  The  NEAS  losses  account  for  46  percent  of  all  enlisted  losses  and  include  recruit, 
retirement,  and  category  losses.  Recruit  losses  occur  at  either  of  the  two  Marine  Corps 
Recruit  Depots  (MCRD)  and  are  calculated  by  gender.  The  first  step  prior  to  estimating 
recruit  losses  is  to  take  into  account  recruit  accession  phasing  which  is  established  by 
Marine  Corps  Recruiting  Command  (MCRC)  by  trimester.  To  forecast  recruit  phasing 
rates  in  the  execution  FY,  enlisted  strength  planners  compute  a  four-year  weighted 
average  of  historical  monthly  phasing  rates  by  month  and  gender.  Next,  the  estimated 
number  of  prior  service  contracts  must  be  subtracted  from  both  the  male  and  female 
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accession  numbers  because  these  individuals  are  not  required  to  go  through  recruit 
training.  The  planners  must  phase  these  male  and  female  net  accession  numbers  over  the 
execution  FY  by  multiplying  the  net  accession  number  by  the  monthly  accession  phasing 
rate  estimated  for  the  FY.  Lastly  enlisted  strength  planners  must  forecast  recruit  loss 
rates  by  month  and  gender  in  order  to  phase  losses  over  the  execution  year.  Again, 
historical  loss  rates  from  the  previous  four  years  are  used  and  averaged  to  estimate  loss 
rates  for  the  next  FY.  In  order  to  forecast  retirement  losses,  enlisted  end  strength 
planners  take  the  average  previous  four  years  historical  number  of  retirements  in 
comparison  to  the  actual  number  of  retirement  packages  submitted  during  the  previous 
FY.  This  requested  and  actual  differential  in  retirement  rates  is  used  in  the  calculation  of 
forecasted  retirement  losses  and  is  distributed  by  month  in  concert  with  the  average  from 
the  previous  four  years  actual  retirements  by  month.  Category  losses  are  defined  as 
losses  that  occur  after  recruit  training  that  are  not  counted  as  EAS  or  retirement  losses. 
All  of  these  category  losses  are  forecasted  together  by  month  utilizing  a  weighted  average 
of  the  previous  three  years’  category  losses  or  Monte  Carlo  simulations. 

The  Other  Loss  Model  is  utilized  to  account  for  enlisted  Marines  that  are  no 
longer  on  active  duty,  but  have  no  loss  code  associated  with  that  loss.  The  enlisted 
strength  planners  use  a  four-year  weighted  average  of  historical  “other  loss”  data  in  order 
to  forecast  these  losses. 

The  Enlisted  to  Officer  Model  accounts  for  the  number  of  active  duty  enlisted 
Marines  who  receive  a  commission  in  the  Marine  Corps,  which  subsequently  increases 
the  number  of  officers,  but  decreases  the  number  of  enlisted  Marines.  Also,  the  civilians 
attending  Officer  Candidates  School  (OCS)  are  paid  as  E-5s  in  the  Marine  Corps  while 
attending  OCS.  Consequently,  the  civilians  that  do  complete  OCS  and  receive  a 
commission  or  do  not  complete  OCS  must  be  counted  as  enlisted  losses. 

The  Gains  Model  encompasses  all  non-prior  service  accessions,  prior  service 
accessions,  deserters  and  other  gains.  The  majority  of  these  gains  are  non-prior  service 
accessions,  which  are  not  forecasted  by  the  enlisted  strength  planners,  but  managed.  All 
other  gains  components  are  forecasted  the  same  by  using  a  four-year  weighted  average 
and  Monte  Carlo  simulations. 
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The  Other  Adjustments  Model  is  the  last  phase  of  the  enlisted  manpower  planning 
process  and  ensures  that  the  end  strength  goal  for  the  given  FY  is  met  after  all  the  losses 
and  gains  have  been  forecast  from  the  models  described  previously.  The  enlisted  strength 
planners  add  in  accessions  to  enlisted  end  strength  and  adjust  it  as  necessary  in  order  to 
achieve  the  mandated  end  strength  number  on  the  last  day  of  the  FY. 

C.  ORRICK  (2008) 

This  Naval  Postgraduate  School  (NPS)  thesis  from  2008  develops  a  regression 
modeling  technique  to  forecast  NEAS  attrition.  The  study  utilizes  a  logistic  regression 
technique  that  identifies  attributes  of  the  individual  Marines’  demographic  profile  that  are 
more  likely  to  be  associated  with  NEAS  losses.  The  findings  of  the  study  predict  NEAS 
losses  for  FY  2005-2007  with  greater  than  76  percent  accuracy  and  misclassify  EAS 
separations  as  NEAS  losses  at  a  rate  below  25  percent.  The  purpose  of  including  this 
thesis  is  to  analyze  a  typical  regression  technique  used  in  many  attrition  studies  in  order 
to  identify  the  strengths  and  weaknesses  of  this  type  of  analysis. 

One  weakness  in  the  thesis  is  the  significant  reduction  in  the  number  of 
observations  from  data  collection  to  final  analysis.  The  data  is  from  the  TFDW  and 
consists  of  three  sets  of  data.  The  first  data  set  encompasses  all  enlisted  losses  from  1 
October  1997  to  30  April  2007.  The  second  data  set  captures  all  enlisted  accessions 
during  that  same  period  and  the  final  data  set  is  a  snapshot  of  the  enlisted  end  strength  on 
30  September  1997.  All  three  data  sets  totaled  587,154  entries,  but  after  cleaning  and 
coding,  the  final  data  set  consisted  of  167,269  observations.  This  large  difference  is  due 
to  missing  variables  in  a  number  of  observations  and  is  a  common  weakness  in  these 
types  of  regression  techniques.  Although  not  a  fault  on  part  of  the  researcher,  this 
significant  reduction  in  observations  degrades  the  validity  of  the  data  in  this  thesis. 

Another  negative  effect  that  the  missing  variables  have  on  the  thesis  is  in  the 
application  of  the  logistic  regression  model.  The  logistic  regression  model  consists  of  the 
binary  dependent  variable  of  attrition  and  5 1  independent  variables  that  explain  attrition 
behavior.  The  independent  variables  are  personal  and  professional  demographic 
information  extracted  from  TFDW,  and  are  the  cause  of  the  large  reduction  in 
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observations  explained  previously.  Over  126,000  observations  are  missing  separation 
codes,  and  were  consequently  deleted  from  the  data  set.  The  author  notes  that  this 
amount  of  missing  observations  “may  have  an  influence  on  the  outcome  of  the  models” 
(page  22).  It  is  necessary  in  a  logistic  regression  model  to  show  the  effects  of  the 
separation  codes  in  explaining  the  relationship  on  attrition.  The  approximately  126,000 
missing  separation  codes  not  only  represent  another  weakness  of  the  thesis,  but  also 
represent  a  weakness  on  model  selection. 

The  application  of  a  logistic  regression  model  to  forecast  NEAS  attrition  is  valid, 
but  the  large  amount  of  discrepancies  in  the  data  degrades  the  validity  of  this  thesis. 
Utilizing  known  independent  variables  in  a  proven  logistic  regression  model  to  explain  a 
binary  dependent  variable  is  a  sound  research  methodology  for  this  type  of  manpower 
research.  The  Receiving  Operator  Characteristics  (ROC)  curves  analysis  shows  the 
logistic  regression  models  performs  well.  This  type  of  regression  analysis  is  not  feasible 
to  use  on  a  regular  basis  in  the  manpower  planning  process.  This  thesis  shows  that  these 
types  of  regression  models  continue  to  provide  insight  into  attrition  behavior,  although 
historical  data  inaccuracies  are  still  the  greatest  challenge  to  the  correct  application  of 
these  models.  The  author  recommends  that  further  research  in  this  area  should  be 
preformed  utilizing  survival  analysis,  by  month  and  by  MOS,  which  is  the  premise  of  the 
following  thesis  by  Hall. 

D.  HALL  (2009) 

This  NPS  thesis  from  2009  applies  parametric  modeling  techniques  to  forecast 
enlisted  attrition.  The  author  includes  those  characteristics  that  influence  attrition 
behavior  in  the  model  and  combines  them  into  one  forecasting  model.  The  thesis 
analyzes  enlisted  Marines  entering  the  service  until  becoming  a  NEAS  loss  or  exiting  the 
service  as  an  EAS  loss.  Hall  uses  personal  and  professional  demographic  characteristics, 
similar  to  the  Orrick’s  thesis,  to  detennine  if  the  characteristics  can  forecast  future 
attrition  behavior.  The  findings  of  the  thesis  are  “that  the  use  of  survival  analysis  could 
be  beneficial  to  not  only  forecast  attrition,  but  also  provide  a  descriptive  assessment  of 
attrition  rates  amongst  occupation  fields  without  loss  of  information  due  to  averaging  or 
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weighting  probabilities”  (v).  The  purpose  of  including  this  thesis  is  to  analyze  a  survival 
analysis  technique  in  order  to  identify  the  strengths  and  weaknesses  of  this  type  of 
analysis. 

The  strength  of  the  thesis  is  a  thorough  data  collection  process  and  a 
methodological  approach  to  its  analysis.  The  master  data  set  comprises  of  25  individual 
data  sets  containing  all  enlisted  Marines  who  entered  in  the  Marine  Corps  between  1 
January  1996  and  31  October  2008.  The  data  sets  capture  all  accessions  per  month  and 
verify  the  continuation  of  service  of  those  Marines  who  accessed  in  previous  FY’s.  The 
master  data  set  includes  a  “Personal  Statistic”  data  set  for  each  FY  to  accompany  the 
accession  data  that  provide  updated  information  of  each  Marine’s  personal  and 
professional  characteristics  as  they  changed  over  time.  A  final  “Separation”  data  set  is  in 
the  master  data  set  to  collect  all  separations  per  FY.  Lastly,  all  observations  in  each  of 
these  three  data  sets  are  collapsed  into  one  observation  per  Marine,  capturing  the  entire 
length  of  service  in  the  master  data  set.  The  initial  master  data  set  contained  419,893 
individual  observations,  but  39,562  were  dropped  due  to  data  inaccuracies  with 
separation  codes  and  gender.  The  final  master  data  set  consists  of  376,710  observations, 
but  3,063  duplicate  entries  are  not  used  in  the  analysis  because  of  the  unreliability  of  the 
data  with  Marines  in  a  “Deserter  Status.”  The  master  data  set  contains  88  personal  and 
professional  demographic  variables,  although  not  all  88  variables  are  used  in  estimations 
in  the  thesis. 

An  additional  strength  of  the  thesis  is  a  thorough  model  selection  process.  The 
author  estimates  the  data  without  covariates,  progresses  to  a  model  with  covariates  and 
concludes  with  a  test  on  the  specific  influences  of  the  covariates  have  on  the  hazard  rate. 
The  hypothesis  is  “that  transition  rates  (hazard  rates)  will  decline  at  a  monotonic  rate  as 
time  increases”  (37).  The  Gompertz  model  without  covariates  shows  proof  that  enlisted 
transition  rates  decrease  as  enlistment  time  increases,  supporting  the  hypothesis,  but  those 
results  did  not  include  the  other  explanatory  variables  that  could  influence  transition 
rates.  Including  56  parameters,  the  Gompertz  model  with  covariates  provides  a  better  log 
likelihood  value  than  the  model  without  covariates.  This  fact  supports  the  author’s 
hypothesis  and  provides  a  better  description  of  the  hazard  rate. 
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Similar  to  the  previous  thesis,  a  weakness  of  this  study  is  that  the  findings  are 
only  as  good  as  the  data  collected.  Both  studies  show  that  the  data  from  the  TFDW  is 
unreliable  in  the  collecting  of  any  number  of  explanatory  variables,  especially  separation 
codes.  Until  the  process  of  collecting  and  archiving  data  within  the  Marine  Corps  is 
improved,  it  is  important  to  use  explanatory  variables  that  are  more  reliable.  Examples  of 
this  are  grade  and  YOS,  which  are  easily  extracted  from  TFDW  and  quickly  calculated 
with  analysis  software.  Both  studies  provide  evidence  that  there  are  numerous 
explanatory  variables  than  help  explain  attrition  behavior,  but  there  are  still  unobservable 
characteristics  of  each  Marine  not  accounted  for  in  the  data,  which  contribute  to  attrition. 
Another  weakness  of  this  thesis  is  that  survival  analysis  is  not  feasible  to  conduct  by 
manpower  planners  in  any  regular  interval.  Simple,  efficient  and  flexible  modeling 
techniques  are  required  in  the  manpower  planning  process. 

The  findings  of  this  study  provide  further  evidence  of  many  of  the  same  insights 
that  manpower  planners  understand  as  common  characteristics  of  attrition  behavior.  For 
example,  the  longer  a  Marine  remains  on  active  duty  then  the  less  likely  that  individual  is 
to  attrite,  females  have  higher  attrition  rates  than  males,  married  Marines  are  less  likely  to 
attrite  than  single  Marines  and  certain  MOS’s  have  higher  attrition  than  others  MOS’s. 
Although  both  Gompertz  models  support  the  hypothesis  and  the  model  with  covariates  is 
more  descriptive  in  its  results,  the  findings  of  the  thesis  provide  nothing  significant  to 
improve  on  the  attrition  forecasting  methods  currently  being  used  by  manpower  planners. 
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III.  DATA 


A.  INTRODUCTION 

This  chapter  focuses  on  the  data  collection,  analysis  software,  and  methodology 
behind  the  calculation  of  historic  enlisted  attrition  rates  by  grade  and  YOS  in  the  Marines 
Corps.  This  chapter  also  discusses  the  models  used  to  forecast  historic  attrition  rates  by 
grade  and  YOS.  The  purpose  of  this  chapter  is  to  provide  a  thorough  understanding  of 
the  data  analysis  process  and  assures  validity  of  the  findings  in  the  following  chapters  of 
this  thesis. 

B.  COLLECTION  AND  SUMMARY  STATISTICS 

This  section  summarizes  the  data  collection  process,  the  variables  of  each 
observation,  and  the  final  statistics.  The  data  was  extracted  from  the  TFDW  in  23 
separate  data  sets.  Each  data  set  is  a  snapshot  of  enlisted  end  strength  on  30  September 
of  each  FY  beginning  in  1987  and  ending  in  2009.  Each  observation  contains  five 
variables;  the  sequence  number,  Social  Security  Number  (SSN),  Anned  Forces  Active 
Duty  Base  Date  (AFADBD),  present  grade  code  and  PMOS  code.  The  sequence  number 
is  used  to  identify  each  30  September  per  FY.  The  SSN  is  used  to  locate  each  individual 
Marine  and  verification  whether  or  not  that  particular  Marine  is  still  on  active  duty  in  the 
in  the  following  FY.  The  AFADBD  is  used  to  calculate  each  Marine’s  YOS  total  at  the 
end  of  each  FY.  YOS  is  calculated  by  actual  years  completed.  For  example,  a  Marine 
with  0  YOS  has  not  yet  completed  one  YOS  and  is  any  enlisted  Marine  with  less  than  12 
months  on  active  duty  since  their  AFADBD.  The  present  grade  code  is  used  to  identify 
what  each  Marine’s  current  grade  is  on  30  September  of  each  FY.  The  PMOS  code  is  not 
used  in  this  study.  The  total  number  of  observations  of  all  23  data  sets  is  3,778,491. 
Analyzing  the  results  using  SAS  and  applying  the  current  service  limits  set  forth  in  the 
ECFC,  the  final  number  of  observations  is  3,578,157.  Table  1  displays  the  grade  and 
YOS  combinations  used  for  this  study. 
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Grade 

YOS  (Min) 

YOS  (Max) 

El 

0 

5 

E2 

0 

5 

E3 

0 

5 

E4 

1 

8 

E5 

2 

13 

E6 

5 

20 

E7 

9 

22 

ES 

14 

27 

E9 

19 

30 

Table  1.  Grade  and  YOS  Combinations 


C.  ANALYSIS  SOFTWARE 

The  primary  means  to  manipulate  the  raw  data  extracted  from  TFDW,  calculate 
historic  enlisted  attrition  by  grade  and  YOS  combinations,  and  apply  time  series 
forecasting  models  to  predict  those  historic  rates  was  done  using  the  SAS  System  for 
Windows  V8.  Microsoft  Excel  is  also  utilized  to  calculate  the  actual  attrition  rates  by 
grade  and  YOS  once  the  raw  data  was  transformed  into  a  usable  format  in  SAS.  The  R 
software  environment  is  used  to  calculate  statistical  significance  of  the  results. 

D.  METHODOLOGY  TO  CALCULATE  ATTRITION  BY  GRADE  AND 
YEARS  OF  SERVICE 

1 .  The  first  step  in  calculating  enlisted  attrition  in  the  Marine  Corps  by  grade 
and  YOS  is  to  import  the  23  separate  data  sets  extracted  from  TFDW  into 
SAS  v8  for  Windows.  The  file  extension  used  to  save  the  data  sets  was  in 
Comma  Separated  Values  (CSV)  format. 

2.  Before  beginning  any  calculations,  the  characters  representing  a  date  in 
history  were  changed  into  recognizable  date  fonnats  in  SAS.  Most 
importantly,  the  sequence  numbers  and  AFADBD  were  changed  to  SAS 
date  elements. 

3.  The  next  step  is  to  calculate  YOS  for  each  observation  by  subtracting  the 
sequence  number  date  from  the  AFADBD  for  each  data  set.  This 
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measurement  of  time  is  in  days  and  is  programmed  in  SAS  to  represent 
cumulative  YOS  by  each  additional  twelve  months  of  service  on  active 
duty. 

4.  Each  data  set  is  sorted  by  identification  number  and  merged  together.  The 
below  constraints  are  required  to  standardize  the  results. 

a.  Drop  observations  if  YOS  is  greater  than  30. 

b.  Drop  observations  if  YOS  is  less  than  zero. 

c.  Drop  observations  if  AFADBD  is  blank. 

5.  The  23  data  sets  were  merged  consecutively  by  year.  The  beginning 
balance  of  enlisted  personnel  is  the  final  end  strength  on  30  September 
1987.  Each  following  FY,  observations  are  identified  to  continue  on 
active  duty  or  to  have  left  active  duty.  Those  observations  that  were  not  in 
the  following  FY’s  data  are  considered  attrition.  This  annual  continue  and 
attrite  information  was  collected  by  FY  totals  using  SAS.  This  merged 
data  is  sorted  by  Present  Grade  and  YOS. 

6.  This  infonnation  is  exported  into  two  excel  files  that  contain  the  total  end 
strength  data  per  FY  in  Appendix  A  and  total  attrition  data  per  FY  by  each 
grade  and  YOS  combination  in  Appendix  B.  The  ECFC  service  limits  are 
applied  and  the  observations  outside  the  constraints  are  dropped. 

7.  Lastly,  the  annual  attrition  rate  is  calculated  for  each  grade  and  YOS 
combination  in  Appendix  C.  This  is  done  by  dividing  the  total  attrition 
number  for  each  FY  by  the  total  end  strength  of  the  previous  FY. 

E.  FORECASTING  MODELS 

The  initial  forecasting  technique  used  in  this  thesis  is  a  simple  MA  model.  This 
model  utilizes  the  historic  attrition  rates  calculated  between  FY87-08  in  one-  to  five-year 
estimation  models.  As  stated  by  Ragsdale  (2001),  “the  predicted  value  of  the  time  series 
in  period  t  +  1  is  simply  the  average  of  the  k  previous  observations  in  the  series”  (491). 
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Ragsdale  further  elaborates  that  there  is  no  general  value  of  k  that  is  best  suited  for  a 
particular  time  series,  thus  multiple  values  of  k  should  be  compared  in  order  to  develop 
the  best  forecast.  As  the  simplest  form  of  forecasting,  the  MA  is  calculated  in  this  thesis 
as  a  baseline  model  for  comparison  of  the  WMA  models.  Below  is  the  equation  for  the 
calculation  of  the  MA  model.  This  equation  and  all  other  equations  used  in  this  chapter 
are  from  Ragsdale’s  textbook.  For  each  grade  i,  and  each  YOS  j,  the  calculation  of  the  k- 
Year  MA  model  is: 


Yi,  j,  t  +  Yi,  j,t  -  1  +  Yi,  j,t  -  k  +  1 

k 


where  i  and  j  are  the  grade  and  YOS  combinations  described  in  Table  1. 

One  disadvantage  of  the  MA  models  is  that  the  values  of  older  data  points  can  have 
disproportionate  effects  on  the  results.  This  is  possible  in  the  case  of  attrition  in  the 
military  because  during  different  periods  in  history  result  in  significant  increases  or 
decreases  in  military  manpower  attrition  from  one  year  to  the  next  because  of 
congressionally  mandated  end  strength  requirements  that  fluctuate  within  the  political 
and  budgetary  environment  within  the  federal  government. 

The  next  forecasting  technique  used  in  this  thesis  is  a  WMA  model.  This  model 
utilizes  the  historic  attrition  rates  calculated  between  FY87-08  in  one-  to  five-year 
model  estimations.  Due  to  the  possible  disproportionate  effects  on  the  results  due  to  the 
older  data  points  of  the  MA  model,  the  WMA  models  allows  for  the  manipulation  of  the 
relative  importance  of  previous  data  points.  Most  WMA  models  weight  the  most  recent 
data  points  more  heavily  and  decrease  the  weights  of  the  preceding  time  periods. 
Ragsdale  notes,  “Although  the  weighted  moving  average  offers  greater  flexibility  than 
the  moving  average,  it  is  also  a  bit  more  complicated”  (495).  For  each  grade  i,  and  each 
YOS  j,  the  calculation  of  the  following  k-  year  WMA  model  is: 


Yi,  j,t  +  1  —  WiYi,  j,  t  +  WiYi,  j,t  -  1  +  . . .  +  WkYi,  j,t  -  k  +  1 


where, 
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As  before,  i  and  j  are  the  grade  and  YOS  combinations  described  in  Table  1.  The 
increased  complication  of  the  WMA  formula  is  that  the  values  for  k  must  be  detennined, 
but  also  the  values  of  each  weight  must  also  be  calculated.  The  relationship  of  each  w  is 
that  the  largest  weight  value  (w\)  starts  with  the  most  recent  data  point  (Y,)  and  the 
subsequent  weights  (w2...Wk)  decrease  in  value  in  concert  with  the  older  data  points  (Yt_i, 
...Y t-k+i).  The  summation  of  the  weights  in  the  formula  must  equal  one.  By  utilizing 
Solver  in  the  Microsoft  excel  software  program,  it  is  possible  to  determine  those  optimal 
values  of  the  weights  that  minimize  the  error  values. 

The  accuracy  of  the  forecasts  will  be  measured  against  the  actual  historic  values 
previously  calculated.  This  study  uses  the  following  Measures  of  Effectiveness  (MOE). 

2 

,  j 


MSE  =  y 


0 Yij-Yi 


MAPE  = 


100 

n 


I 


Yi, 


Where,  i  and  j  are  the  grade  and  YOS  combinations  described  in  Table  1.  For  a  particular 
model  and  a  given  FY,  MSE  is  the  squared  error  of  each  grade  and  YOS  estimate 
averaged  over  all  such  estimates  for  that  FY.  Due  to  the  unique  distribution  of  enlisted 
Marines,  with  the  overwhelming  majority  of  the  force  in  the  E-l  through  E-5  pay  grades, 
and  the  E-6  through  E-9  pay  grades  representing  a  small  minority  of  the  force,  a  pyramid 
force  structure  is  observed.  The  densely  populated  bottom  and  sparsely  populated  top  of 
the  pyramid  have  different  effects  on  the  MSE  and  MAPE  measurements  of  accuracy. 
As  a  result,  MSE  is  the  error  measurement  to  utilize  in  the  manpower  planning  process  if 
it  is  more  important  to  be  accurate  in  the  aggregate.  MSE  will  tend  to  select  the  models 
that  most  accurately  describe  the  most  densely  populated  grade  and  YOS  combinations, 
which  are  the  E-l  through  E-5  grade  and  YOS  combinations  calculated  in  this  study. 


On  the  other  hand,  observing  the  pyramid  shape  of  the  enlisted  force  structure, 
MAPE  makes  accuracy  in  predicting  all  grade  and  YOS  combinations  equally  important. 

As  a  result,  MAPE  will  tend  to  select  models  that  explain  all  grade  and  YOS 
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combinations  equally,  which  puts  extra  emphasis  on  getting  those  sparsely  populated 
grade  and  YOS  combinations  at  the  top  of  the  pyramid  correct.  The  benefit  of  utilizing 
MAPE  in  measuring  the  accuracy  of  time  series  forecasts  is  that  regardless  of  the 
difference  in  values,  these  differences  are  translated  to  a  percent  of  total  observations  of 
the  particular  grade  and  YOS  combination. 
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IV.  RESULTS 


A.  GENERAL  RESULTS  FOR  EACH  MODEL 

Applying  the  MA  and  WMA  model  to  the  historic  enlisted  attrition,  the 
calculation  of  forecasted  attrition  numbers  for  FY88-08  was  made  for  each  grade  and 
YOS  combination  in  Table  1.  This  process  was  completed  for  five  (one-  to  five-year 
models)  forecasts,  which  resulted  in  1,953  forecasts  per  each  year  category.  The  1,953 
forecasts  represent  92  total  grade  and  YOS  combinations  multiplied  by  21  years  of  data. 
Then  the  MAPE  and  MSE  were  calculated  for  FY88-07  forecasts,  which  resulted  in 
another  (5  x  1,860)  9,300  error  calculations.  The  reason  for  the  465  difference  in  the 
number  of  error  results  is  because  the  historic  attrition  from  FY09  is  required  in  order  to 
calculate  the  error  for  the  FY08  forecast,  but  this  thesis  did  not  calculate  the  attrition  from 
FY09.  The  perfonnance  of  each  of  the  models  is  compared  for  each  year.  Each  of  these 
data  points  are  ranked  on  a  scale  from  1-5,  based  on  the  value  of  each  FY  to  denote  the 
lowest  to  highest  error  value.  The  following  sections  of  this  chapter  discuss  the  specific 
results  of  each  model  and  error  calculation. 

B.  MOVING  AVERAGE  MODEL  (MSE) 

The  FY  average  error  values  are  in  Table  2,  and  the  plotted  data  points  are  in 
Figure  2.  Table  2  reveals  the  average  MSE  results  range  from  a  low  of  1,542  in  the  five- 
year  MA  model  in  FY05  to  a  high  of  56,473  in  the  four-year  MA  model  in  FY07. 


FY  88 

FY  89 

FY  90 

FY  91 

FY  92 

FY  93 

FY  94 

FY  95 

FY  96 

FY  97 

FY  98 

FY  99 

FY  00 

FY01 

FY  02 

FY  03 

FY  04 

FY  05 

FY  06 

FY  07 

1  YR 

5536 

6661 

10108 

43500 

7537 

3423 

3086 

3160 

2409 

5925 

4367 

12772 

3669 

2854 

2587 

3268 

2370 

2168 

6378 

37679 

2  YR 

11138 

15391 

43936 

21804 

2952 

3269 

2711 

4009 

7691 

8630 

10200 

13078 

5931 

4215 

3591 

1808 

1660 

4278 

51942 

3  YR 

22714 

39254 

31855 

7674 

3559 

3547 

4438 

9554 

12214 

8372 

15463 

14542 

7087 

4421 

2210 

1713 

4687 

52316 

4  YR 

35648 

35163 

11955 

5285 

3116 

5570 

11001 

15133 

7454 

14589 

19485 

14072 

5632 

2652 

1708 

3851 

56473 

5  YR 

35200 

13613 

8907 

4527 

5805 

12965 

17608 

7957 

13010 

20597 

20101 

8573 

3766 

1542 

3700 

56016 

Table  2.  Average  MSE  Results  by  Fiscal  Year 
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The  plotted  data  points  in  Figure  2  reveal  a  similar  trend  in  all  the  MSE  results 
except  for  an  increase  in  the  one -year  and  two-year  MSE  values  in  FY99.  The  remaining 
three  estimations  during  FY99  all  decrease  in  average  MSE.  The  spike  in  MSE  in  the 
earlyl990’s  period  is  due  to  the  total  number  of  attrition  of  enlisted  Marines  during  that 
period  is  significantly  different  from  the  surrounding  years  due  to  the  build  up  and 
execution  of  Operation  Desert  Storm/Shield  and  the  corresponding  release  of  troops  in  its 
aftermath.  The  number  of  attrition  of  enlisted  Marines  in  FY88-91  averaged  30,000,  but 
in  FY92-93  the  average  increased  to  34,000  and  returned  to  the  30,000  average  until 
FY03. 

The  spike  in  MSE  in  FY07  period  is  due  to  the  decrease  in  enlisted  attrition  in 
FY05-08.  The  average  dropped  to  around  26,500  during  the  this  period  because  the 
Marine  Corps  was  increasing  end  strength  from  around  160,000  in  FY05  to  nearly 
178,000  enlisted  Marines  in  FY08. 


MA  MSE  RESULTS 
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Figure  2.  Moving  Average  MSE  Results 

The  rankings  in  Table  3  (less  FY88-91),  reveal  that  the  one-year  MA  MSE  is 
lowest  during  thirteen  years,  the  two-year  MA  MSE  is  lowest  during  three  years,  the 
three-year  MA  MSE  is  never  the  lowest,  the  four-year  MA  MSE  is  lowest  during  one 
year,  and  the  five-year  MA  MSE  is  lowest  during  two  years. 
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FY  88 

FY  89 

FY  90 

FY  91 

FY  92 

FY  93 

FY  94 

FY  95 

FY  96 

FY  97 

FY  98 

FY  99 

FY  00 

FY  01 

FY  02 

FY  03 

FY  04 

FY  05 

FY  06 

FY  07 

1  YR 

1 

1 

1 

3 

1 

2 

1 

3 

1 

1 

1 

5 

1 

1 

1 

1 

3 

5 

5 

1 

2  YR 

2 

2 

4 

2 

1 

2 

1 

2 

2 

2 

4 

3 

2 

2 

2 

1 

2 

3 

2 

3  YR 

3 

2 

3 

3 

3 

4 

3 

3 

3 

3 

5 

3 

3 

3 

2 

4 

4 

3 

4  YR 

1 

4 

4 

4 

2 

4 

4 

4 

1 

4 

4 

4 

4 

4 

3 

2 

5 

5  YR 

5 

5 

5 

5 

5 

5 

5 

2 

2 

5 

5 

5 

5 

1 

1 

4 

Table  3.  Moving  Average  MSE  Rankings 


The  Friedman  test  is  used  to  detennine  the  statistical  significance  of  the  results.  The 
hypotheses  of  the  Friedman  Test  are: 

Ho:  Each  ranking  of  the  random  variables  within  a  block  is  equally  likely  (i.e.,  the 
treatments  have  identical  effects). 

Hi:  At  least  one  of  the  treatments  tends  to  yield  larger  observed  values  than  at 

least  one  other  treatment. 

The  test  of  the  null  hypothesis  that  there  is  no  difference  in  the  effectiveness  of 
any  of  these  models  suggests  sufficient  evidence  exists  to  reject  the  null  hypothesis  (p- 
value  .003).  The  post  hoc  multiple  comparison  of  the  five  models  is  shown  in  Table  4. 


Model 

1-Year 

A 

2-Year 

A 

3-Year 

A 

B 

4-Year 

B 

5-Year 

B 

‘Levels  not  connected  by  same  letter  are  significantly  different 

Table  4.  Post  Hoc  Multiple  Comparison  of  Moving  Average  (MSE)  Models 

The  one-year  and  two-year  models  are  significantly  different  and  better  in 
comparison  to  the  four  and  five-year  models.  The  three-year  model  is  not  significantly 
different  from  the  four  other  year  models. 
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c. 


MOVING  AVERAGE  MODEL  (MAPE) 


The  average  error  values  for  each  FY  are  in  Table  5  and  the  plotted  data  points  are 
in  Figure  3.  Table  5  reveals  the  average  MAPE  results  range  from  a  low  of  0.149  in  the 
three-year  MA  model  in  FY98  to  a  high  of  0.327  in  the  five-year  MA  model  in  FY96. 


FY  88 

FY  89 

FY  90 

FY  91 

FY  92 

FY  93 

FY  94 

FY  95 

FY  96 

FY  97 

FY  98 

FY  99 

FY  00 

FY01 

FY  02 

FY  03 

FY  04 

FY  05 

FY  06 

FY  07 

1  YR 

0.213 

0.286 

0.264 

0.262 

0.259 

0.289 

0.265 

0.231 

0.242 

0.215 

0.158 

0.242 

0.194 

0.197 

0.225 

0.228 

0.210 

0.262 

0.286 

0.230 

2  YR 

0.261 

0.262 

0.269 

0.262 

0.266 

0.295 

0.277 

0.253 

0.179 

0.157 

0.212 

0.206 

0.209 

0.242 

0.207 

0.208 

0.230 

0.227 

0.201 

3  YR 

0.270 

0.251 

0.262 

0.253 

0.284 

0.303 

0.275 

0.190 

0.149 

0.209 

0.227 

0.239 

0.257 

0.208 

0.210 

0.194 

0.213 

0.208 

4  YR 

0.251 

0.275 

0.235 

0.251 

0.311 

0.320 

0.226 

0.172 

0.196 

0.238 

0.238 

0.293 

0.222 

0.208 

0.194 

0.226 

0.205 

5  YR 

0.273 

0.247 

0.240 

0.284 

0.327 

0.273 

0.211 

0.204 

0.242 

0.249 

0.313 

0.246 

0.210 

0.196 

0.232 

0.184 

Table  5.  Average  MAPE  Results  by  Fiscal  Year 


The  plotted  data  points  in  Figure  3  reveal  a  similar  trend  in  all  the  MAPE  results 
except  for  an  increase  in  the  MAPE  of  the  one-year  and  two-year  estimates  in  FY99  then 
a  decrease  in  FY00.  The  remaining  three  years’  average  MAPE  results  increase  during 
both  FY99  and  FY00. 
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Figure  3.  Moving  Average  MAPE  Results 


MAMAPE  RESULTS 
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-1  YK 
-2YR 
-3  YR 
-4YR 
-5  YR 


The  MAPE  rankings  in  Table  6  (less  FY88-91),  reveal  that  the  one-year  MA 
MAPE  is  lowest  during  six  years,  the  two-year  MA  MAPE  is  lowest  during  three  years, 
the  three-year  MA  MAPE  is  the  lowest  during  two  years,  the  four-year  MA  MAPE  is 
lowest  during  three  years,  and  the  five-year  MA  MAPE  is  lowest  during  two  years. 
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FY  88 

FY  89 
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FY  91 

FY  92 

FY  93 
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FY  95 
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FY  97 

FY  98 
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FY  02 

FY  03 

FY  04 

FY  05 

FY  06 

FY  07 
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1 
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3 
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3 
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1 
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1 

2 

2 

5  YR 

4 

2 

1 

3 

5 

5 

5 

2 

5 

5 

5 

5 

5 

3 

4 

1 

Table  6.  Moving  Average  MAPE  Rankings 


A  Friedman  Test  of  the  null  hypothesis  that  there  is  no  difference  in  the  effectiveness  of 
any  of  these  models  reveals  insufficient  evidence  exists  to  reject  this  hypothesis  (p-value 
0.2). 

D.  WEIGHTED  MOVING  AVERAGE  MODEL  (MSE) 

The  Solver  add-in  for  Microsoft  Excel  applies  the  equation  introduced  in  Chapter 
III  and  selects  the  values  for  each  weight  (wn)  that  minimizes  the  MSE  or  MAPE  for  that 
model,  over  the  course  of  all  years  in  the  dataset.  In  this  case,  the  optimal  weights 
calculated  using  Solver,  give  the  majority  of  the  weight  to  the  year  closest  to  the  current 
year.  The  optimal  weights  in  Table  7  state  that  essentially  all  WMA  MSE  models  are 
best  estimated  as  essentially  a  one -year  MA  model. 


Weight  1 

Weight  2 

Weight  3 

Weight  4 

Weight  5 

2-Year 

1 

0 

3-Year 

1 

0 

0 

4-Year 

0.9542 

0 

0 

0.0458 

5-Year 

0.9971 

0 

0 

0 

0.0029 

Table  7.  Optimal  MSE  Weights  Calculated  Using  Solver 


The  FY  average  error  values  are  in  Table  8  and  the  plotted  data  points  are  in  Figure  4. 
Table  8  reveals  the  average  MSE  results  range  from  a  low  of  2,036  in  the  four-year 
WMA  model  in  FY05  to  a  high  of  43,500  in  the  one -year  through  three-year  WMA 
model  in  FY91. 
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FY  88 

FY  89 

FY  90 

FY  91 

FY  92 

FY  93 

FY  94 

FY  95 

FY  96 

FY  97 

FY  98 

FY  99 

FY  00 

FY  01 

FY  02 

FY  03 

FY  04 

FY  05 

FY  06 

FY  07 

1  YR 

5536 

6661 

10108 

43500 

7537 

3423 

3086 

3160 

2409 

5925 

4367 

12772 

3669 

2854 

2587 

3268 

2370 

2168 

6378 

37679 

2  YR 

6661 

10108 

43500 

7537 

3423 

3086 

3160 

2409 

5925 

4367 

12772 

3669 

2854 

2587 

3268 

2370 

2168 

6378 

37679 

3  YR 

10108 

43500 

7537 

3423 

3086 

3160 

2409 

5925 

4367 

12772 

3669 

2854 

2587 

3268 

2370 

2168 

6378 

37679 

4  YR 

41297 

8236 

3006 

2904 

3009 

2497 

6202 

4800 

11710 

3807 

3485 

3014 

3152 

2232 

2036 

6092 

38920 

5  YR 

7560 

3390 

3087 

3153 

2415 

5949 

4397 

12701 

3662 

2879 

2616 

3238 

2363 

2153 

6364 

37723 

Table  8.  Average  MSE  Results  by  Fiscal  Year 


The  plotted  data  points  in  Figure  4  reveal  a  nearly  identical  trend  in  all  the  year  MSE 
results.  The  majority  of  the  years  have  the  exact  same  average  MSE  results  and  the 
biggest  difference  in  MSE  values  is  2,203  in  FY91  in  the  four-year  WMA  model  in 
comparison  to  the  three  other  models’  MSE  results  that  year. 
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Figure  4.  Weighted  Moving  Average  MSE  Results 


The  MSE  rankings  in  Table  9  (less  FY88-91),  reveal  that  the  one-year  through 
three-year  WMA  MSE  is  lowest  during  seven  years,  the  four-year  WMA  MSE  is  lowest 
during  eight  years,  and  the  five-year  WMA  MSE  is  lowest  during  one  year. 
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Table  9.  Weighted  Moving  Average  MSE  Rankings 


A  Friedman  Test  of  the  null  hypothesis  that  there  is  no  difference  in  the  effectiveness  of 
any  of  these  models  reveals  insufficient  evidence  exists  to  reject  this  hypothesis  (p-value 
1.0). 

E.  WEIGHTED  MOVING  AVERAGE  MODEL  (MAPE) 

The  optimal  weights  in  Table  10  calculated  using  Solver,  give  over  65%  of  the 
weight  to  the  year  closest  to  the  current  year  except  for  in  the  two-year  WMA  model. 


Weight  1 

Weight  2 

Weight  3 

Weight  4 

Weight  5 

2-Year 

0.3044 

0.6956 

3-Year 

0.6711 

0.1985 

0.1305 

4-Year 

0.6628 

0.1937 

0.1187 

0.0248 

5-Year 

0.6637 

0.2078 

0.0965 

0.0320 

0 

Table  10.  Optimal  Weights  Calculated  Using  Solver 

The  FY  average  error  values  are  in  Table  1 1  and  the  plotted  data  points  are  in  Figure  5. 
Table  1 1  reveals  the  average  MAPE  results  range  from  a  low  of  .149  in  the  three  WMA 


model  in  FY98  to  a  high  of  .289  in  the  one -year  WMA  model  in  FY93. 


FY  88 

FY  89 

FY  90 

FY  91 

FY  92 

FY  93 

FY  94 

FY  95 

FY  96 

FY  97 

FY  98 

FY  99 

FY  00 

FY01 

FY  02 

FY  03 

FY  04 

FY  05 

FY  06 

FY  07 

1  YR 

0.213 

0.286 

0.264 

0.262 

0.259 

0.289 

0.265 

0.231 

0.242 

0.215 

0.158 

0.242 

0.194 

0.197 

0.225 

0.228 

0.210 

0.262 

0.286 

0.230 

2  YR 

0.268 

0.252 

0.260 

0.253 

0.252 

0.277 

0.252 

0.244 

0.189 

0.153 

0.221 

0.196 

0.198 

0.230 

0.211 

0.205 

0.239 

0.247 

0.196 

3  YR 

0.254 

0.250 

0.251 

0.238 

0.265 

0.254 

0.243 

0.190 

0.149 

0.222 

0.202 

0.208 

0.231 

0.210 

0.205 

0.226 

0.245 

0.197 

4  YR 

0.250 

0.252 

0.235 

0.260 

0.255 

0.247 

0.192 

0.150 

0.221 

0.202 

0.206 

0.234 

0.209 

0.204 

0.225 

0.245 

0.196 

5  YR 

0.253 

0.237 

0.261 

0.255 

0.247 

0.192 

0.150 

0.220 

0.201 

0.204 

0.234 

0.209 

0.204 

0.226 

0.245 

0.195 

Table  1 1 .  Average  MAPE  Results  by  Fiscal  Year 

The  plotted  data  points  in  Figure  5  reveal  a  similar  trend  in  all  the  results  of  the 
MAPE  except  for  an  increase  in  the  MAPE  of  the  one -year  model  in  FY93  when  all  other 
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MAPE  results  decrease  during  that  year.  The  one-year  MAPE  results  also  decreases  in 
FY95  and  increase  in  FY96,  when  all  other  model  MAPE  results  decrease  in  FY95  and  in 
FY96. 
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Figure  5.  Weighted  Moving  Average  MAPE  Results 


The  MAPE  rankings  in  Table  12  (less  FY88-91),  reveal  that  the  one-year  WMA 
MAPE  is  lowest  during  five  years,  the  two-year  WMA  MAPE  is  lowest  during  one  year, 
the  three-year  WMA  MAPE  is  the  lowest  during  two  years,  the  four-year  WMA  MAPE 
is  lowest  during  four  years,  and  the  five-year  WMA  MAPE  is  lowest  during  four  years. 
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Table  12.  Weighted  Moving  Average  MAPE  Rankings 
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V.  CONCLUSION 


A.  SUMMARY 

The  purpose  of  this  thesis  is  to  analyze  historical  USMC  enlisted  attrition 
behavior  and  apply  time  series  forecasting  techniques  by  grade  and  YOS  in  order  to 
identify  methods  to  improve  manpower  analysts’  ability  to  effectively  forecast  attrition 
behavior.  The  application  of  time  series  forecasting  techniques  to  analyze  historical 
enlisted  end  strength  data  by  grade  and  YOS  provides  sufficient  evidence  that  in  most 
instances  a  one-year  MA  model  is  superior  to  that  of  the  two-  to  five-year  MA  and  one- 
to  five-year  WMA  models.  Depending  on  the  goal  of  manpower  analysts  forecasting 
attrition  by  grade  and  YOS,  the  MSE  and  MAPE  MOE’s  of  the  forecasts  are 
interchangeable. 

B.  RECOMMENDATIONS 

1.  Research  Question  One 

Of  the  techniques  most  accessible  to  manpower  analysts,  which  of  these  best 
forecast  enlisted  attrition  behavior  in  the  Marine  Corps  by  grade  and  YOS? 

Analysis  of  the  MA  and  WMA  time  series  forecasting  techniques  and  application 
of  one-  to  five-year  estimation  models  provide  sufficient  evidence  that  a  one-year  MA 
model  is  the  best  technique  to  utilize  when  forecasting  attrition  by  grade  and  YOS.  In 
fact,  the  optimal  weights  for  the  WMA  models  are  equivalent  to  a  one-year  MA  model. 

This  fact  is  important  to  understand  in  the  field  of  manpower  analysis.  In  the 
complex  and  rapidly  changing  environment  of  manpower  analysis,  time  is  a  precious 
commodity  that  must  be  rationed  appropriately  among  competing  requirements.  More 
importantly,  the  ability  to  rapidly  estimate  accurate  attrition  forecasts  by  grade  and  YOS 
allows  manpower  analysts  to  gain  time  to  focus  their  efforts  on  other  key  responsibilities. 
Forecasting  enlisted  attrition  by  grade  and  YOS  is  a  simple  and  flexible  method  for 

manpower  analysts  to  utilize.  The  grade  and  AFADBD  variables  are  reliable  when 
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extracted  from  historical  database  archives  such  as  the  TFDW.  The  calculation  of  YOS 
from  the  AFADBD  is  simple  to  execute  in  the  SAS  program  currently  used  by  manpower 
analysts. 

Based  on  this  analysis,  the  recommendation  to  MPP-20  and  MPP-50  is  that  the 
use  of  a  one-year  MA  forecasting  technique  is  the  most  effective  way  to  estimate  enlisted 
attrition  rates  in  the  Marine  Corps  by  grade  and  YOS  in  comparison  to  the  other  models 
used  in  this  study. 

2.  Research  Question  Two 

How  does  the  choice  of  technique  depend  on  the  measure  of  effectiveness? 

This  study  found  statistical  significance  only  in  the  MA  models  using  MSE,  but 
not  using  MAPE.  The  one-  and  two-year  models  are  significantly  different  and  better  in 
comparison  to  the  four-  and  five-year  models.  The  three-year  model  is  not  significantly 
different  from  the  four  other  year  models  using  MSE. 

In  contrast,  the  WMA  models  have  no  practical  or  statistical  significant  difference 
using  MSE  or  MAPE.  Further  analysis  of  the  MSE  and  MAPE  measurements  of 
accuracy  of  the  estimates  provide  evidence  that  either  MSE  or  MAPE  are  appropriate 
MOE’s  depending  on  the  density  of  the  population  of  interest. 

Based  on  this  analysis,  the  recommendation  to  MPP-20  and  MPP-50  is  that 
regardless  of  the  MOE,  a  one-year  MA  forecasting  model  is  superior  to  the  other  models 
analyzed  in  this  study. 
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